324 research outputs found

    MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine

    Full text link
    The increasing computational and memory requirements of Deep Learning (DL) workloads has led to outstanding innovations in hardware architectures. An archetype of such architectures is the novel Versal AI Engine (AIE) by AMD/Xilinx. The AIE comprises multiple programmable processors optimized for vector-based algorithms. An AIE array consisting of 400 processor cores, operating at 1.25 GHz is able to deliver a peak throughput of 8 TFLOPs for 32-bit floating-point (fp32), and 128 TOPs for 8-bit integer (int8) precision. In this work, we propose MaxEVA: a novel framework to efficiently map Matrix Multiplication (MatMul) workloads on Versal AIE devices. Our framework maximizes the performance and energy efficiency of MatMul applications by efficiently exploiting features of the AIE architecture and resolving performance bottlenecks from multiple angles. When demonstrating on the VC1902 device of the VCK190 board, MaxEVA accomplishes up to 5.44 TFLOPs and 77.01 TOPs throughput for fp32 and int8 precisions, respectively. In terms of energy efficiency, MaxEVA attains up to 124.16 GFLOPs/W for fp32, and 1.16 TOPs/W for int8. Our proposed method substantially outperforms the state-of-the-art approach by exhibiting up to 2.19x throughput gain and 20.4% higher energy efficiency. The MaxEVA framework provides notable insights to fill the knowledge gap in effectively designing MatMul-based DL workloads on the new Versal AIE devices.Comment: Accepted as full paper at FPT 202

    Metabolic syndrome and abdominal fat are associated with inflammation, but not with clinical outcomes, in peritoneal dialysis patients

    Get PDF
    BACKGROUND: In the general population, metabolic syndrome (MetS) is correlated with visceral fat and a risk factor for cardiovascular disease (CVD); however, little is known about the significance of abdominal fat and its association with inflammation and medication use in peritoneal dialysis (PD) patients. We investigated the relationship of visceral fat area (VFA) with C-reactive protein (CRP) levels and medication use in PD patients and followed their clinical outcomes. METHODS: In a prospective study from February 2009 to February 2012, we assessed diabetes mellitus (DM) status, clinical and PD-associated characteristics, medication use, CRP levels, components of MetS, and VFA in 183 PD patients. These patients were categorized into 3 groups based on MetS and DM status: non-MetS (group 1, n = 73), MetS (group 2, n = 65), and DM (group 3, n = 45). VFA was evaluated by computed tomography (CT) and corrected for body mass index (BMI). RESULTS: Patients in group 1 had smaller VFAs than patients in groups 2 and 3 (3.2 ± 1.8, 4.6 ± 1.9, and 4.9 ± 2.0 cm(2)/[kg/m(2)], respectively, P < 0.05) and lower CRP levels (0.97 ± 2.31, 1.27 ± 2.57, and 1.11 ± 1.35 mg/dL, respectively, P < 0.05). VFA increased with the number of criteria met for MetS. After adjusting for age, body weight, and sex, CRP and albumin levels functioned as independent positive predictors of VFA; on other hand, the use of renin-angiotensin system blockers was inversely correlated with VFA in PD patients without DM. In the survival analysis, DM patients (group 3) had the poorest survival among the 3 groups, but no significant differences were found between groups 1 and 2. CONCLUSION: This study showed that VFA and MetS are associated with CRP levels but cannot predict survival in PD patients without DM. The complex relationship of nutritional parameters to VFA and MetS may explain these results. The type of antihypertensive medication used was also associated with the VFA. The mechanisms behind these findings warrant further investigation

    AMiBA Wideband Analog Correlator

    Get PDF
    A wideband analog correlator has been constructed for the Yuan-Tseh Lee Array for Microwave Background Anisotropy. Lag correlators using analog multipliers provide large bandwidth and moderate frequency resolution. Broadband IF distribution, backend signal processing and control are described. Operating conditions for optimum sensitivity and linearity are discussed. From observations, a large effective bandwidth of around 10 GHz has been shown to provide sufficient sensitivity for detecting cosmic microwave background variations.Comment: 28 pages, 23 figures, ApJ in press

    AMiBA: scaling relations between the integrated Compton-y and X-ray derived temperature, mass, and luminosity

    Full text link
    We investigate the scaling relations between the X-ray and the thermal Sunyaev-Zel'dovich Effect (SZE) properties of clusters of galaxies, using data taken during 2007 by the Y.T. Lee Array for Microwave Background Anisotropy (AMiBA) at 94 GHz for the six clusters A1689, A1995, A2142, A2163, A2261, and A2390. The scaling relations relate the integrated Compton-y parameter Y_{2500} to the X-ray derived gas temperature T_{e}, total mass M_{2500}, and bolometric luminosity L_X within r_{2500}. Our results for the power-law index and normalization are both consistent with the self-similar model and other studies in the literature except for the Y_{2500}-L_X relation, for which a physical explanation is given though further investigation may be still needed. Our results not only provide confidence for the AMiBA project but also support our understanding of galaxy clusters.Comment: Accepted by ApJ; 8 pages, 3 figures, 5 table

    AMiBA: Broadband Heterodyne CMB Interferometry

    Get PDF
    The Y. T. Lee Array for Microwave Background (AMiBA) has reported the first science results on the detection of galaxy clusters via the Sunyaev Zel'dovich effect. The science objectives required small reflectors in order to sample large scale structures (20') while interferometry provided modest resolutions (2'). With these constraints, we designed for the best sensitivity by utilizing the maximum possible continuum bandwidth matched to the atmospheric window at 86-102GHz, with dual polarizations. A novel wide-band analog correlator was designed that is easily expandable for more interferometer elements. MMIC technology was used throughout as much as possible in order to miniaturize the components and to enhance mass production. These designs will find application in other upcoming astronomy projects. AMiBA is now in operations since 2006, and we are in the process to expand the array from 7 to 13 elements.Comment: 10 pages, 6 figures, ApJ in press; a version with high resolution figures available at http://www.asiaa.sinica.edu.tw/~keiichi/upfiles/AMiBA7/mtc_highreso.pd

    KCNN2 polymorphisms and cardiac tachyarrhythmias

    Get PDF
    Potassium calcium-activated channel subfamily N member 2 (KCNN2) encodes an integral membrane protein that forms small-conductance calcium-activated potassium (SK) channels. Recent studies in animal models show that SK channels are important in atrial and ventricular repolarization and arrhythmogenesis. However, the importance of SK channels in human arrhythmia remains unclear. The purpose of the present study was to test the association between genetic polymorphism of the SK2 channel and the occurrence of cardiac tachyarrhythmias in humans. We enrolled 327 Han Chinese, including 72 with clinically significant ventricular tachyarrhythmias (VTa) who had a history of aborted sudden cardiac death (SCD) or unexplained syncope, 98 with a history of atrial fibrillation (AF), and 144 normal controls. We genotyped 12 representative tag single nucleotide polymorphisms (SNPs) across a 141-kb genetic region containing the KCNN2 gene; these captured the full haplotype information. The rs13184658 and rs10076582 variants of KCNN2 were associated with VTa in both the additive and dominant models (odds ratio [OR] 2.89, 95% confidence interval [CI] = 1.505-5.545, P = 0.001; and OR 2.55, 95% CI = 1.428-4.566, P = 0.002, respectively). After adjustment for potential risk factors, the association remained significant. The population attributable risks of these 2 variants of VTa were 17.3% and 10.6%, respectively. One variant (rs13184658) showed weak but significant association with AF in a dominant model (OR 1.91, CI = 1.025-3.570], P = 0.042). There was a significant association between the KCNN2 variants and clinically significant VTa. These findings suggest an association between KCNN2 and VTa; it also appears that KCNN2 variants may be adjunctive markers for risk stratification in patients susceptible to SCD

    Cardiac Myosin Binding Protein C and MAP-Kinase Activating Death Domain-Containing Gene Polymorphisms and Diastolic Heart Failure

    Get PDF
    OBJECTIVE: Myosin binding protein C (MYBPC3) plays a role in ventricular relaxation. The aim of the study was to investigate the association between cardiac myosin binding protein C (MYBPC3) gene polymorphisms and diastolic heart failure (DHF) in a human case-control study. METHODS: A total of 352 participants of 1752 consecutive patients from the National Taiwan University Hospital and its affiliated hospital were enrolled. 176 patients diagnosed with DHF confirmed by echocardiography were recruited. Controls were matched 1-to-1 by age, sex, hypertension, diabetes, renal function and medication use. We genotyped 12 single nucleotide polymorphisms (SNPs) according to HapMap Han Chinese Beijing databank across a 40 kb genetic region containing the MYBPC3 gene and the neighboring DNA sequences to capture 100% of haplotype variance in all SNPs with minor allele frequencies ≥ 5%. We also analyzed associations of these tagging SNPs and haplotypes with DHF and linkage disequilibrium (LD) structure of the MYBPC3 gene. RESULTS: In a single locus analysis, SNP rs2290149 was associated with DHF (allele-specific p = 0.004; permuted p = 0.031). The SNP with a minor allele frequency of 9.4%, had an odds ratio 2.14 (95% CI 1.25-3.66; p = 0.004) for the additive model and 2.06 for the autosomal dominant model (GG+GA : AA, 95% CI 1.17-3.63; p = 0.013), corresponding to a population attributable risk fraction of 12.02%. The haplotypes in a LD block of rs2290149 (C-C-G-C) was also significantly associated with DHF (odds ratio 2.10 (1.53-2.89); permuted p = 0.029). CONCLUSIONS: We identified a SNP (rs2290149) among the tagging SNP set that was significantly associated with early DHF in a Chinese population

    Mass and Hot Baryons in Massive Galaxy Clusters from Subaru Weak Lensing and AMiBA SZE Observations

    Full text link
    We present a multiwavelength analysis of a sample of four hot (T_X>8keV) X-ray galaxy clusters (A1689, A2261, A2142, and A2390) using joint AMiBA Sunyaev-Zel'dovich effect (SZE) and Subaru weak lensing observations, combined with published X-ray temperatures, to examine the distribution of mass and the intracluster medium (ICM) in massive cluster environments. Our observations show that A2261 is very similar to A1689 in terms of lensing properties. Many tangential arcs are visible around A2261, with an effective Einstein radius \sim 40 arcsec (at z \sim 1.5), which when combined with our weak lensing measurements implies a mass profile well fitted by an NFW model with a high concentration c_{vir} \sim 10, similar to A1689 and to other massive clusters. The cluster A2142 shows complex mass substructure, and displays a shallower profile (c_{vir} \sim 5), consistent with detailed X-ray observations which imply recent interaction. The AMiBA map of A2142 exhibits an SZE feature associated with mass substructure lying ahead of the sharp north-west edge of the X-ray core suggesting a pressure increase in the ICM. For A2390 we obtain highly elliptical mass and ICM distributions at all radii, consistent with other X-ray and strong lensing work. Our cluster gas fraction measurements, free from the hydrostatic equilibrium assumption, are overall in good agreement with published X-ray and SZE observations, with the sample-averaged gas fraction of = 0.133 \pm 0.027, for our sample = (1.2 \pm 0.1) \times 10^{15} M_{sun} h^{-1}. When compared to the cosmic baryon fraction f_b = \Omega_b/\Omega_m constrained by the WMAP 5-year data, this indicates /f_b = 0.78 \pm 0.16, i.e., (22 \pm 16)% of the baryons are missing from the hot phase of clusters.Comment: accepted for publication in ApJ; high resolution figures available at http://www.asiaa.sinica.edu.tw/~keiichi/upfiles/AMiBA7/ms_highreso.pd
    corecore